Document Summarization and Classification using Concept and Context Similarity Analysis
نویسندگان
چکیده
“Document summarization and classification using concept and context similarity analysis’’ deals with an information retrieval task, which aims at extracting a condensed version of the original document. A document summary is useful since it can give an overview of the original document in a shorter period of time. The main goal of a summary is to present the main ideas in a document/set of documents in a short and readable paragraph. Classification is a data mining function that assigns items in a collection to target categories of the documents. Context sensitive document indexing model based on the Bernoulli model of randomness is used for document summarization process. The lexical association between terms is used to produce a context sensitive weight to the document terms. The context sensitive indexing weights are used to compute the sentence similarity matrix and as a result, the sentences are presented in such a way that the most informative sentences appear on the top of the summary, making a positive impact on the quality of the summary
منابع مشابه
Text Rank: A Novel Concept for Extraction Based Text Summarization
Indexing used in text summarization has been an active area of current researches. Text summarization plays a crucial role in information retrieval. Snippets generated by web search engines for each query result is an application of text summarization. Existing text summarization techniques shows that the indexing is done on the basis of the words in the document and consists of an array of the...
متن کاملA survey on Automatic Text Summarization
Text summarization endeavors to produce a summary version of a text, while maintaining the original ideas. The textual content on the web, in particular, is growing at an exponential rate. The ability to decipher through such massive amount of data, in order to extract the useful information, is a major undertaking and requires an automatic mechanism to aid with the extant repository of informa...
متن کاملDocument Summarization Retrieval System Based on Web User Needs
Existing models for document summarization mostly use the similarity between sentences in the document to extract the most salient sentences. The documents as well as the sentences are indexed using traditional term indexing measures, which do not take the context into consideration. Therefore, the sentence similarity values remain independent of the context. In this paper, we propose a context...
متن کاملUsing Context Inference to Improve Sentence Ordering for Multi-document Summarization
In this paper, we propose a novel context inference-based approach for sentences ordering in mult i-document summarization application. Our method first detects whether or not two summarizat ion sentences should be adjacent according to the similarity between one summarizat ion sentence and the context of the other one, and then it computes the reliability of the adjacent summarization sentence...
متن کاملEXTRACTION-BASED TEXT SUMMARIZATION USING FUZZY ANALYSIS
Due to the explosive growth of the world-wide web, automatictext summarization has become an essential tool for web users. In this paperwe present a novel approach for creating text summaries. Using fuzzy logicand word-net, our model extracts the most relevant sentences from an originaldocument. The approach utilizes fuzzy measures and inference on theextracted textual information from the docu...
متن کامل